Improving Speech Understanding through Integration of Prosody and Syntax
نویسنده
چکیده
As automatic speech recognition technology improves and as it is applied to more difficult tasks, the need to integrate with syntactic, semantic, pragmatic and knowledge-based system components will increase. This paper covers the development of models which relate prosodic phrasing to syntactic structure. These models are useful for understanding the linguistic aspects of the prosody-syntax relationship and have potential for improving the performance of speech recognition systems. There are three important results from these models. Firstly, the use of a link grammar provides an effective syntactic framework for predicting prosody that is simpler and more accurate than conventional constituent analysis. Secondly, with the use of the link grammar framework, the left syntactic context is important for predicting prosodic structure while the right context is not significant. Thirdly, the models lead to a novel extension to the n-gram stochastic language model mechanism which incorporates prosodic constraints in a way that can be directly integrated with HMM-based speech recognition.
منابع مشابه
Reading aloud: eye movements and prosody
This study aims to connect data from ocular movements and reading aloud speech to syntactic and discursive properties of texts, in order to understand integrative cognitive processes during reading for understanding and to identify prosodic and eye movements’ indicators of reading fluency. Assuming that in reading aloud there is a close interaction between syntax structure and speech prosody, w...
متن کاملImproving the Robustness of Prosody Dependent Language Modeling Based on Prosody Syntax Dependence
This paper presents a novel approach that improves the robustness of prosody dependent language modeling by leveraging the dependence between prosody and syntax. A prosody dependent language model describes the joint probability distribution of concurrent word and prosody sequences and can be used to provide prior language constraints in a prosody dependent speech recognizer. Robust Maximum Lik...
متن کاملTowards an Integration of Speech and Natural Language Processing
Man-machine communication with natural language requires the integration of heterogeneous knowledges within a homogeneous framework. This problem concerns more particularly the speech/natural language interface. We propose in this paper an HPSG view of prosody illustrated with French intonation phenomena. We show that this linguistic theory is adapted for the integration representation: typed f...
متن کاملSession 13: Prosody
Tho aim of this introductory s~tion is to set the context for Session 13: Prosody. It will do so by defining some basic terms, by considering the status of current research on prosody, and by outlining the papers in the session and how they contribute to and complement previous work in the area. Prosody, perceptually, can be thought of as the relative temporal groupings of words and the relativ...
متن کاملEmploying Sentence Structure: Syntax Trees as Prosody Generators
In this paper, we describe a prosody generation system for speech synthesis that makes direct use of syntax trees to obtain duration and pitch. Instead of transforming the tree through special rules or extracting isolated features from the tree, we make use of the tree structure itself to construct a superpositional model that is able to learn the relation between syntax and prosody. We impleme...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1994